Verification Based Solution for Structured MAB Problems

نویسنده

  • Zohar S. Karnin
چکیده

We consider the problem of finding the best arm in a stochastic Multi-armed Bandit (MAB) game and propose a general framework based on verification that applies to multiple well-motivated generalizations of the classic MAB problem. In these generalizations, additional structure is known in advance, causing the task of verifying the optimality of a candidate to be easier than discovering the best arm. Our results are focused on the scenario where the failure probability must be very low; we essentially show that in this high confidence regime, identifying the best arm is as easy as the task of verification. We demonstrate the effectiveness of our framework by applying it, and matching or improving the state-of-the art results in the problems of: Linear bandits, Dueling bandits with the Condorcet assumption, Copeland dueling bandits, Unimodal bandits and Graphical bandits.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Potentiometric Study of Binary and Mixed Complexes of Imidazole, Histamine, Histidine and Diacetyl Monooxime with Some Transition Metal Ions in Aqueous Solution

The complexation reactions between diacetyl monooxime (Damo), imidazole (Him), histamine (Hist) and histidine (His) with Co2+, Ni2+ and Cu2+ were studied potentiometrically in aqueous solution at 25 °C and m= 0.5 M KNO3. The overall stability constants log b's of species were obtained by computer refinement of p...

متن کامل

A Numerical Method For Solving Physiology Problems By Shifted Chebyshev Operational Matrix

In this study, a numerical solution of singular nonlinear differential equations, stemming from biology and physiology problems, is proposed. The methodology is based on the shifted Chebyshev polynomials operational matrix of derivative and collocation. To assess the accuracy of the method, five numerical problems, such as the human head, Oxygen diffusion and Bessel differential equation, were ...

متن کامل

Verification of an Evolutionary-based Wavelet Neural Network Model for Nonlinear Function Approximation

Nonlinear function approximation is one of the most important tasks in system analysis and identification. Several models have been presented to achieve an accurate approximation on nonlinear mathematics functions. However, the majority of the models are specific to certain problems and systems. In this paper, an evolutionary-based wavelet neural network model is proposed for structure definiti...

متن کامل

Verification of a Quality Management Theory: Using a Delphi Study

Background A model of quality management called Strategic Collaborative Quality Management (SCQM) model was developed based on the quality management literature review, the findings of a survey on quality management assessment in healthcare organisations, semi-structured interviews with healthcare stakeholders, and a Delphi study on healthcare quality management experts. The purpose of this stu...

متن کامل

A New Approach for Solving Fully Fuzzy Bilevel Linear Programming Problems

This paper addresses a type of fully fuzzy bilevel linear programming (FFBLP) wherein all the coefficients and decision variables in both the objective function and constraints are triangular fuzzy numbers. This paper proposes a new simple-structured, efficient method for FFBLP problems based on crisp bilevel programming that yields fuzzy optimal solutions with unconstraint variables and parame...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016